Enzyme catalyzed template-independent creation of phosphodiester bonds using protected nucleotides

ABSTRACT

A method for the stepwise creation of phosphodiester bonds between desired nucleosides resulting in the synthesis of polynucleotides having a predetermined nucleotide sequence by preparing an initiation substrate containing a free and unmodified 3&#39;-hydroxyl group; attaching a mononucleotide selected according to the order of the predetermined nucleotide sequence to the 3&#39;-hydroxyl of the initiating substrate in a solution containing a catalytic amount of an enzyme capable of catalyzing the 5&#39; to 3&#39; phosphodiester linkage of the 5&#39;-phosphate of the mononucleotide to the 3&#39;-hydroxyl of the initiating substrate, wherein the mononucleotide contains a protected 3&#39;-hydroxyl group, whereby the protected mononucleotide is covalently linked to the initiating substrate and further additions are hindered by the 3&#39;-hydroxyl protecting group. Methods in which a mononucleotide immobilized on a solid support is added to a free polynucleotide chain are also disclosed.

TECHNICAL FIELD

This invention relates to the synthesis of oligonucleotides and other nucleic acid polymers using template independent enzymes.

BACKGROUND OF THE INVENTION

Oligonucleotides are presently synthesized in vitro using organic synthesis methods. These methods include the phophoramidite method described in Adams et al., J. Amer. Chem. Soc., 105:661 (1983) and Froehler et al., Tetrahedron Lett., 24:3171 (1983) and the phosphotriester method described in German Offenlegungsshrift 264432. Other organic synthesis methods include those described by Froehler et al., U.S. Pat. No. 5,264,566 in which H-phosphonates are used to produce oligonucleotides.

The phosphoramidite method of phosphodiester bond formation and oligonucleotide synthesis represents the current state of the art employed by most laboratories for the coupling of desired nucleotides without the use of a template. In this method, the coupling reaction is initiated by a nucleoside attached to a solid support. The 5'-hydroxyl group of the immobilized nucleoside is free for coupling with the second nucleoside of the chain to be assembled. Since the growing oligonucleotide chain projects a 5'-hydroxyl available for reaction with a mononucleotide, the direction of synthesis if referred to as 3' to 5'.

Each successive mononucleotide to be added to the growing oligonucleotide chain contains a 3' -phosphoramidate moiety which reacts with the 5'-hydroxyl group of the support-bound nucleotide to form a 5' to 3' internucleotide phosphodiester bond. The 5'-hydroxyl group of the incoming mononucleotide is protected, usually by a trityl group, in order to prevent the uncontrolled polymerization of the nucleosides. After each incoming nucleoside is added, the protected 5'-hydroxyl group is deprotected, so that it is available for reaction with the next incoming nucleoside having a 3'-phosphoramidite group and a protected 5'-hydroxyl. This is followed by deprotection and addition of the next incoming nucleotide, and so forth.

Between each nucleoside addition step, unreacted chains which fail to participate in phosphodiester bond formation with the desired nucleoside are chemically "capped" to prevent their further elongation. This usually involves chemical acetylation.

This method and the other currently used organic methods while widely accepted require large amounts of costly monomers that require complex organic synthesis schemes to produce. In addition, these methods are complex in that the phosphoramidite method requires an oxidation step after each condensation reaction. The phosphotriester method requires that the subpopulation of oligonucleotides that have not had a monomer added in a particular cycle be capped in a separate reaction to prevent further chain elongation of these oligonucleotides.

Other drawbacks of virtually all chemical methods of phosphodiester bond formation, is that the reaction must be performed in organic solvents and in the absence of water. Many of these organic solvents are toxic or otherwise hazardous. Another drawback to chemical synthesis is that it is at best 98 percent efficient at each cycle. In other words, following each nucleotide addition, at least 2 percent of the growing oligonucleotide chains are capped, resulting in a yield loss. The total yield loss for the nucleotide chain being synthesized thus increases with each nucleotide added to the sequence.

For example, assuming a yield of 98 percent per nucleotide addition, the synthesis of a polynucleotide consisting of 70 mononucleotides would experience a yield loss of nearly 75 percent. Furthermore, the object nucleotide chain would require isolation from a reaction mixture of polynucleotides, nearly 75 percent of which consist of capped oligonucleotides ranging between 1 and 69 nucleotides in length.

This inherent inefficiency in chemical synthesis of oligonucleotides ultimately limits the length of oligonucleotide that can be efficiently produced to oligonucleotides having 50 nucleic acid residues or less.

A need exists for a method which improves the efficiency of phosphodiester bond formation and which could ultimately be capable of producing shorter chain oligonucleotides in higher yields and longer chain polynucleotides in acceptable yields. In addition, a need exists for a polynucleotide synthesis system which is compatible with pre-existing polynucleotides, such as vector DNAs, so that desired polynucleotide sequences can readily be added on to the pre-existing sequences. Chemical coupling by the phosphoramidite method is not compatible with "add-on" synthesis to pre-existing polynucleotides. Enzyme catalyzed phosphodiester bond formation, however, can be performed in an aqueous environment utilizing either single or double stranded oligo- or polynucleotides to initiate the reaction. These reaction conditions also greatly minimize the use of toxic and hazardous materials.

The 3' to 5' direction of synthesis inherent to the phosphoramidite method of phosphodiester bond formation cannot be enzyme catalyzed. All known enzymes capable of catalyzing the formation of phosphodiester bonds do so in the 5' to 3' direction since the growing polynucleotide strand always projects a 3'-hydroxyl available for attachment of the next nucleoside.

There are many enzymes capable of catalyzing the formation of phosphodiester bonds. One class of such enzymes, the polymerases, are largely template dependent in that they add a complementary nucleotide to the 3' hydroxyl of the growing strand of a double stranded polynucleotide. However, some polymerases are template independent and primarily catalyze the formation of single stranded nucleotide polymers. Another class of enzyme, the ligases, are template independent and form a phosphodiester bond between two polynucleotides or between a polynucleotide and a mononucleotide.

Addition of single nucleotides to DNA fragments, catalyzed by deoxynucleotidyl terminal transferase (TdTase), has previously been described by Deng and Wu, Meth. Enzymol., 100:96-116, 1983. These reaction conditions did not involve transient protection of the 3'-hydroxyl nor were they intended to be used for the sequential creation of phosphodiester bonds to synthesize a predetermined nucleotide sequence. The presence of unprotected 3'-hydroxyls resulted in a highly heterogeneous population of reaction products.

Similarly, prior attempts to catalyze synthesis of very short pieces of RNA or DNA using protected nucleotide monophosphates or diphosphates resulted in unacceptably low levels of the desired phosphodiester bond formation or required excessive amounts of enzyme to achieve acceptable efficiencies. These problems were largely due to unavoidable heterogeneity of the mononucleotide building blocks or to the very high turnover number of the enzyme, necessitating extremely long incubation times (see, for example, Hinton and Gumport, Nucleic Acids Res. 7:453-464, 1979; Kaufman et al., Eur. J. Biochem., 24: 4-11, 1971). These experiments were limited to 5'-monophosphates and diphosphates. No attempts have been made to catalyze controlled DNA synthesis using 5'-triphosphates protected at the 3' position.

Enzyme catalyzed creation of a single phosphodiester bond between the free 3'-hydroxyl group of an oligonucleotide chain and the 5'-phosphate of a mononucleotide requires protection of the 3'-hydroxyl of the mononucleotide in order to prevent multiple phosphodiester bond formations and hence repeated mononucleotide additions. Protection of the 3'-hydroxyl of the mononucleotide ideally involves a transient blocking group which can readily be removed in order to allow subsequent reactions. Flugel et al., Biochem. Biophys. Acta. 308:35-40, 1973, report that nucleoside triphosphates with blocked 3'-hydroxyl groups cannot be prepared directly. This lack of 3' blocked triphosphates necessitated previous processes to utilize lower energy and thus more inefficient 3' blocked monophosphates and diposphates. Synthetic techniques to create 3' block triphosphates would be highly desirable, because this could enable stepwise enzyme catalyzed phosphodiester bond formation leading to polynucleotide synthesis.

These prior attempts at synthesizing oligonucleotides using a template independent polymerase were extremely inefficient resulting in the production of very short oligonucleotides. The inefficiency of these methods made these methods useless for practical synthesis of oligonucleotides.

The present invention allows the creation of phosphodiester bonds between nucleotides using a template independent polymerase to create oligonucleotides having a predetermined sequence. This enzyme catalysis can vastly improve the efficiency of phosphodiester bond formation between desired nucleotides compared to current techniques of chemical coupling and can be carried out in the presence of other biological molecules such as pre-existing sequences of single or double stranded DNA as well as other types of enzymes. In addition, the very high specificity inherent to enzyme catalysis allows only coupling of a 5'-phosphate to a 3'-hydroxyl. The coupling of two mononucleosides, as well as various other side reactions inherent to chemical coupling techniques, simply do not occur.

A further advantage of the present invention is realized by using 3' blocked triphosphates having high energy phosphate bonds which an enzyme can utilize to drive the reaction to greater completion level than when other monophosphates and diphosphates are used. In addition, triphosphates are less strongly hydrated than the diphosphate, which also tends to drive catalytic hydrolysis of the triphosphate to completion.

Clearly, the availability of a homogeneous population of protected mononucleotide triphosphates and enzymes capable of efficiently joining protected nucleotides to initiating substrates will enable the creation of a highly uniform population of synthetic polynucleotides resulting from stepwise phosphodiester bond formation.

SUMMARY OF THE INVENTION

A number of methods have been discovered by which the 3'-hydroxyl group of a deoxynucleotide triphosphate can be effectively protected and deprotected and wherein the protected nucleotide is utilized by a template independent polymerase to create a phosphodiester bond permitting the synthesis of oligonucleotides or polynucleotides having a desired predetermined sequence.

Therefore, in accordance with the present invention, a method is provided for the synthesis of a polynucleotide of a predetermined sequence of which method includes the steps of:

A. providing an initiating substrate comprising a nucleoside having an unprotected 3'-hydroxyl group; and

B. reacting under enzymatic conditions in the presence of a catalytic amount of an enzyme the 3'-hydroxyl group of the initiating substrate with a nucleoside 5'-triphosphate having a removable blocking moiety protecting the 3' position of the nucleoside 5'-triphosphate and selected according to the order of the predetermined sequence, so that enzyme catalyzes the formation of a 5' to 3' phosphodiester linkage between the unprotected 3'-hydroxyl group of the initiating substrate and the 5'-phosphate of the nucleoside 5'-triphosphate to produce the polynucleotide.

In other embodiments of the present invention, the method further comprises the step:

C. removing the blocking moiety protecting the 3' position of said nucleotide 5'-triphosphate to produce an initiating substrate having an unprotected 3'-hydroxyl group.

In other embodiments, steps (b) and (c) are repeated at least once to add additional nucleotides to the initiating substrate by alternatively adding a nucleoside 5'-triphosphate with a removable blocking moiety at its 3' position, deblocking the 3' position of the terminal nucleoside and then adding another nucleoside 5'-triphosphate with a removable blocking group at its 3' position. Repetition of steps (b) and (c) can also be carried out to produce an oligonucleotide or polynucleotide having a predetermined sequence.

The present invention contemplates initiating substrates that are deoxynucleosides, nucleotides, single or double stranded oligonucleotides, single or double stranded polynucleotides, oligonucleotides attached to nonnucleoside molecules and the like.

The present invention contemplates embodiments in which the substrate is immobilized on a solid support. Preferred solid supports include cellulose, Sepharose, controlled-pore glass, silica, Fractosil, polystyrene, styrene divinyl benzene, agarose, and crosslinked agarose and the like.

The present invention contemplates the use of template independent polynucleotide polymerases such as terminal deoxynucleotidyl transferase from any number of sources including eukaryotes and protharyotes.

The methods of the present invention utilize removable blocking moieties that block the 3' position of nucleoside 5'-triphosphates used in the methods. Preferred removable blocking moieties can be removed in under 10 minutes to produce a hydroxyl group at the 3' position of the 3' nucleoside. Removable blocking groups contemplated include carbonitriles, phosphates, carbonates, carbamates, esters, ethers, borates, nitrates, sugars, phosphoramidates, phenylsulfenates, sulfates and sulfones.

The methods of the present invention contemplate removing the removable blocking moiety using a deblocking solution that preferably contains divalent cations such as Co++ and a biological buffer such as comprises a buffer selected from the group consisting of dimethylarsinic acid, tris[hydroxymethyl] amino methane, and 3-[m-morpholine] propanosulphonic acid. Other embodiments of the present invention utilize an enzyme present in the deblocking solution to remove the removable blocking moiety.

The present invention also contemplates methods in which the nucleoside 5'-triphosphate having the removable blocking moiety at its 3' position is immobilized in a solid support and reacted with free initiating substrates. The solid support is linked to the nucleoside 5'-triphosphate at the 3'-hydroxyl group, thereby acting as a removable blocking moiety at the 3' position. Attachment of the nucleoside to the support is transient, thereby enabling the release of the newly synthesized product from the support and regeneration of the free and unmodified 3'-hydroxyl to allow the next nucleotide addition to occur.

Thus, in some embodiments of the present invention the deblocking solution would remove the removable blocking moiety at the position of the nucleoside and thus release the growing polynucleotide from the solid support.

The present invention also includes polynucleotides having a predetermined sequence provided according to the methods of this invention. Applications for using polynucleotides and oligonucleotides of the present invention in molecular cloning and/or expression of genes, peptides or proteins.

Also contemplated by the present invention are compositions of matter comprising a catalytic amount of a template independent enzyme and a nucleoside 5'-triphosphate having a removable blocking moiety protecting the 3' position of said nucleoside 5'-triphosphate. Additional compositions of matter further comprising an initiating substrate are also contemplated.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1. A diagram showing enzymatic synthesis of an oligonucleotide using a template independent polymerase and a nucleoside 5' triphosphate having a removable blocking moiety at its 3' position is shown.

FIG. 2. A diagram of a nucleotide having a removable blocking moiety at its 3' position is shown.

FIG. 3. A diagram showing an apparatus for automating the enzymatic synthesis of polynucleotides is shown.

FIG. 4. A diagram showing an apparatus for automating the enzymatic synthesis of polynucleotides is shown.

BRIEF DESCRIPTION OF THE INVENTION

A. Definitions

DNA: Deoxyribonucleic acid.

RNA: Ribonucleic acid.

Nucleotide: A subunit of a nucleic acid comprising a phosphate group, a 5-carbon sugar and nitrogen containing base. In RNA, the 5-carbon sugar is ribose. In DNA, it is a 2-deoxyribose. The term also includes analogs of such subunits.

Nucleoside: Includes a nucleosidyl unit and is used interchangeably therewith, and refers to a subunit of a nucleic acid which comprises a 5-carbon sugar and a nitrogen containing base. The term includes not only those nucleosidyl units having A, G, C, T and U as their bases, but also analogs and modified forms of the naturally-occurring bases, such as pseudoisocytosine and pseudouracil and other modified bases (such as 8-substituted purines). In RNA, the 5-carbon sugar is ribose; in DNA, it is 2'-deoxyribose. The term nucleoside also includes other analogs of such subunits, including those which have modified sugars such as 2'-O-alkyl ribose.

Polynucleotide: A nucleotide multimer generally about 50 nucleotides or more in length. These are usually of biological origin or are obtained by enzymatic means. ##STR1## wherein phosphodiester groups may be used as internucleosidyl phosphorus group linkages (or links) to connect nucleosidyl units.

Non-nucleoside monomeric unit: A monomeric unit wherein the base, the sugar and/or the phosphorus backbone or other internuclosidyl linkage of a nucleoside has been replaced by other chemical moieties.

Polypeptide and Peptide: A linear series of amino acid residues connected on to the other by peptide bonds between the alpha-amino and carboxyl groups of adjacent residues.

Protein: A linear series of greater than about 50 amino acid residues connected one to the other as in a polypeptide.

Gene: A segment of DNA coding for an RNA transcript that is itself a structural RNA, such as ribosomal RNA or codes for a polypeptide. The segment of DNA is also equipped with a suitable promoter, termination sequence and optionally other regulatory DNA sequences.

Structural Gene: A gene coding for a structural RNA and being equipped with a suitable promoter, termination sequence and optionally other regulatory DNA sequences.

Promoter: A recognition site on a DNA sequence or group of DNA sequences that provide an expression control element for a gene and to which RNA polymerase specifically binds and initiates RNA synthesis (transcription) of that gene.

Oligonucleotide: A chain of nucleosides which are linked by internucleoside linkages which is generally from about 2 to about 50 nucleosides in length. They may be chemically synthesized from nucleoside monomers or produced by enzymatic means. The term oligonucleotide refers to a chain of nucleosides which have internucleosidyl linkages linking the nucleoside monomer and, thus, includes oligonucleotide containing nucleoside analogs, oligonucleotide having internucleosidyl linkages such that one or more of the phosphorous group linkages between monomeric units has been replaced by a non-phosphorous linkage such as a formacetal linkage, a thioformacetal linkage, a sulfamate linkage, or a carbamate linkage. It also includes nucleoside/non-nucleoside polymers wherein both the sugar and the phosphorous moiety have been replaced or modified such as mopholino base analogs, or polyamide base analogs. It also includes nucleoside/non-nucleoside polymers wherein the vase, the sugar, and the phosphate backbone of a nucleoside are either replaced by a non-nucleoside moiety or wherein a non-nucleoside moiety is inserted into the nucleoside/non-nucleoside polymer. Thus an oligonucleotide may be partially or entirely phophonothioates, phosphorothioate phosphorodithioate phosphoramidate or neutral phosphate ester such as phosphotriesters oligonucleotide analogs.

Removable Blocking Moiety: A removable blocking moiety is a moiety which is attached to the oxygen at the 3' position of a nucleoside or the equivalent position in a nucleoside analog. The removable blocking moiety prevents reaction of the 3' oxygen when present and is removable under deblocking conditions so that the 3' oxygen can then participate in a chemical reaction.

A. Methods

Generally, the present invention provides methods for synthesizing oligonucleotides and polynucleotides having a predetermined sequence using a template independent polymerase and nucleoside having the 3' position blocked with a removable blocking moiety so that single nucleosides are added to the growing oligonucleotide. Single nucleosides are added to the growing chain by removing the blocking moiety at the 3' position of the terminal nucleoside of the growing oligonucleotide so that the next blocked nucleoside can be added to the oligonucleotide. This process is then repeated until the oligonucleotide having the predetermined sequence is produced.

Thus, in accordance with this embodiment of the present invention, a method comprises the steps of:

(a) providing an initiating substrate comprising a nucleoside having an unprotected 3'-hydroxyl group; and

(b) reacting under enzymatic conditions in the presence of a catalytic amount of an enzyme said 3'-hydroxyl group of said initiating substrate with a nucleoside 5'-triphosphate having a removable blocking moiety protecting the 3' position of said nucleoside 5'- triphosphate and selected according to the order of said predetermined sequence, whereby said enzyme catalyzes the formation of a 5' to 3' phophodiester linkage between said unprotected 3'-hydroxyl group of said initiating substrate and the 5'-phosphate of said nucleoside 5'-triphosphate to produce said polynucleotide.

In preferred embodiments, the methods of the present invention further comprises the step of:

(c) removing the blocking moiety protecting the 3' position of said nucleoside 5'-triphosphate to produce an initiating substrate having an unprotected 3'-hydroxyl group.

This additional step regenerates a reactive atom at the 3' position of the terminal nucleoside so that this atom can be used to form a bond with the next nucleoside and thus extend the length of the oligonucleotide by one nucleoside.

The methods of the present invention also include methods in which the above steps (b) and (c) are repeated at least once to produce an oligonucleotide. This process can be repeated many times to produce oligonucleotides of selected length. This process can also be repeated many times such that each particular nucleoside added to the oligonucleotide having a preselected sequence.

1. Initiating Substrates

An initiating substrate of the present invention is prepared containing a nucleoside with a free and unmodified 3'-hydroxyl group. As is well understood by those of ordinary skill in the art, nucleotide derivatives of the nucleosides adenosine, cytidine, guanosine, uridine and thymidine can be assembled to form oligonucleotides acid polynucleotides. According to the method of the present invention, the initiating substrate may contain a single nucleoside having a free and unmodified 3'-hydroxyl group, or a preassembled oligo- or polynucleotide may be provided as an initiating substrate, so long as the oligo- or polynucleotide has a free and unmodified 3'-hydroxyl group.

One skilled in the art will understand that an initiating substrate could be provided in a form in which a nucleoside has a removable blocking moiety at its 3' position which is subsequently removed using a deblocking process so that the initiating substrate now has the free unprotected 3' hydroxyl group useful in the present invention.

The initiating substrates of the present invention include the termini of polynucleotides frequently generated and used in various cloning and molecular biology techniques. Examples of these initiating substrates include the termini of DNA or RNA vectors, single-stranded or double-stranded fragments, single-stranded or double-stranded RNA fragments and RNA or DNA oligonucleotides.

In the preferred embodiments, initiating substrates will consist wholly or in part of an oligo- or polynucleotide. The initiating substrate can be any arrangement of nucleosides which enables the enzyme to create a phosphodiester bond between the 3'-hydroxyl of a nucleoside and the 5'-phosphate of a mononucleotide. Initiating substrates may be based wholly or in part on ribonucleic acids (RNA) or deoxyribonucleic acids (DNA) and may be single stranded or multi-stranded. In addition, initiating substrates can include other types of naturally occurring or synthetic molecules (non-nucleosides) which may enable or enhance the ability of the enzyme to create a phosphodiester bond or which may facilitate the manipulation of reaction components and by-products. An example of this would be a linker molecule (commonly used linkers consist of C, O, N, and H e.g. Affi-Gel™ 10: R--OCH₂ CONH (CH₂)₂ NHCO(CH₂)₂ COON(CH₂)₂ which would serve to provide a convenient method for attaching an initiating substrate to a solid support.

The sequential creation of phosphodiester bonds and hence the addition of nucleotides to the initiating substrate may be performed entirely in solution, or the initiating substrate may be attached to an insoluble matrix. Attachment to an insoluble matrix will permit the rapid separation of the substrate from unreacted reagents in order to prepare the substrate for the addition of the next nucleotide. For this reason, the substrate is preferably affixed to a solid support matrix during each reaction creating a phosphodiester bond.

Insoluble matrices suitable for use as solid supports include cellulose, Sepharose™, controlled-pore glass (CPG), polystyrene, silica, agarose, and the like.

Reagents, buffers and solvents suitable for use with the present invention are capable of flowing through the solid support matrix, by which means the initiating substrate is brought into contact with these materials. The growing nucleotide chain remains attached to the solid support as the various reagents, buffers and solvents sequentially flow therethrough. The solid support matrix is preferably contained within a synthesis column, to which reagents, buffers and solvents are provided.

Attachment of the initiating substrate to the solid support can be by covalent bonding. Numerous methods for the covalent attachment of molecules to insoluble matrices have been described and are well understood by those of ordinary skill in the art. In the preferred embodiment an oligonucleotide chain may be linked to alkylamine derivatized polystyrene or CPG by way of a covalent phosphoramidate bond although numerous strategies for linking oligonucleotides to solid supports have been described. The choice of an appropriate linking strategy will depend on the specific requirements of stability, charge interactions, solubility and the like.

Alternatively, attachment of the initiating substrate to the solid support can be by non-covalent interactions. Numerous methods for the transient attachment of molecules to insoluble matrices have been described and are well understood by those of ordinary skill in the art. For example, an oligonucleotide derivative containing single or multiple biotin molecules may be attached to avidin-agarose or streptavidin-agarose to form a non-covalent linkage between the oligonucleotide and the insoluble agarose matrix.

In general, it is envisioned that single and double stranded oligo- and polynucleotides based on DNA or RNA may be covalently or non-covalently bound to solid supports to form a variety of initiating substrates. Regardless of the strategy employed to attach an initiating substrate to an insoluble matrix, a nucleoside with a free and unmodified 3'-hydroxyl group will always be available for enzyme catalyzed creation of a phosphodiester bond.

2. Template Independent Enzymes

Mononucleotides are added to the free and unmodified 3'-hydroxyl group of the initiating substrate by reacting the substrate with the 5'-phosphate of the selected mononucleotide in the presence of a catalytic amount of an enzyme capable of creating the phosphodiester bond covalently linking the 5'-phosphate of the mononucleotide with the 3'-hydroxyl of the substrate. The enzyme is preferable a template independent enzyme such as a template independent polynucleotide polymerase. Template independent enzymes such as template independent polynucleotide polymerases are capable of catalyzing the formation of a phosphodiester bond between the nucleotides without requiring the presence of a complementary nucleotide strand for activity. Thus, the template independent enzymes such as template independent polynucleotide polymerases are able to catalyze the formation of single-stranded nucleic acid polymers without requiring a complementary nucleic acid strand to act as a template. Examples of template independent polynucleotide polymerases include terminal deoxynucleotidyl transferases. Template independent polynucleotide polymerases can be isolated from a number of sources including calf thymus and other sources of lymphocytes. A particularly preferred polymerase is terminal deoxynucleotidyl transferase (TdTase, EC 2.7.7.31).

Enzymes capable of being utilized with the present invention can be readily identified by those of ordinary skill in the art, and are employed under appropriate and well understood conditions. Example enzymatic conditions for deoxynucleotidyl transferase include a pH of 6.8 maintained by a potassium cacodylate buffer, 8 mmol/l of MgCl₂, 1 mmol of β mercaptoethambol, 0.33 mmol/l of ZnSO₄. One skilled in the art will understand that these enzymatic conditions may vary while still allowing the enzyme to catalyze the desired reaction.

3. Nucleosides Having Removable Blocking Moieties

In accordance with the present invention, the mononucleotide has its 3' position protected by a removable blocking moiety so that a single phosphodiester linkage is formed between the free 3'-hydroxyl of the initiating substrate and the 5'-phosphate group of the mononucleotide. The removable blocking moiety protecting the 3' position of the mononucleotide prevents the catalytic creation of multiple phosphodiester bonds and hence multiple nucleotide additions.

Nucleotides having a removable blocking moiety protecting the 3' position suitable for use with the present invention have a structure corresponding to Formula 1, that has a structure which is compatible with the utilization of the entire nucleotide for the creation of a phosphodiester bond by the enzyme. ##STR2##

B is the nucleotide base and R₂ represents the appropriate mono-, di- or triphosphate. R₁ can be an ester linkage, COR₁ ', which forms the structure nucleotide-3'--O--CO--R₁ '. R₁ '0 can be any alkyl or aryl group compatible with the utilization of the molecule by the enzyme for the creation of an internucleotide phosphodiester bond. The chemistry of esters as protecting groups for hydroxyls is well established. Removable blocking moieties including formate, benzoyl formate, acetate, substituted acetate, propionate, isobutyrate, levulinate, crotonate, benzoate, napthoate and many other esters have been described in detail (See, Greene, T. W., Protective Groups in Organic Chemistry, John Wiley & Sons, New York, 1981). Esters in general are readily removed, usually in the presence of base, to regenerate the hydroxyl group and thus are useful as removable blocking moieties.

Ester removable blocking moieties are formed by reacting the nucleotide with the appropriate acid anhydride. Alternatively, a carboxylic acid can be esterified with the 3'-hydroxyl of the nucleotide in the presence of water after activation by reaction with carbonyl diimidazole (See, Schafer et al., Meth Enzymol., 126, 682-712.)

An alternative type of removable blocking moiety utilizes an ether linkage which forms the structure nucleotide-3'--O--R'₁. In this instance R'₁ can be methyl, substituted meythyl, ethyl, substituted ethyl, butyl, allyl, cinnamyl, benzyl, substituted benzyl, anthryl or silyl. The chemistry involved in using ethers as removable blocking moieties for hydroxyls is well known in the art. Numerous ethers have been described and are useful for transiently protecting hydroxyls and similar chemical groups.

Additional well known removable blocking moieties useful for protecting for hydroxyls include carbonitriles, phosphates, carbonates, carbamates, borates, nitrates, phosphoramidates, and phenylsulfenates. Most of these chemical modifications to the nucleotide can be removed by chemical reactions. Some modifications may also be removed by enzymatic digestion resulting in the regeneration of the 3' hydroxyl. These would include phosphates, glycosides, and certain esters.

Attachment of the nucleotide having a removable blocking moiety protecting the 3'- position to the free and unmodified 3'-hydroxyl of the initiating substrate is then accomplished by reacting [incubating] the aforementioned nucleotide and the substrate with an enzyme capable of forming a phosphodiester bond between the two. Specifically, this bond would link the 5'-phosphate of the mononucleotide with the 3'-hydroxyl of the initiating substrate. This reaction can be performed either free in solution or, in one embodiment of the invention, the initiating substrate is immobilized on a solid support.

Particularly preferred are removable blocking moieties and deblocking reaction conditions that allow the blocking moiety to be removed in under 10 minutes to produce a hydroxyl group at the 3' position of the 3'-terminal nucleoside. Other preferred removable blocking moieties and deblocking conditions allow the blocking moiety to be removed in less than 8, 7, 6, 5, 4, 3, 2, or 1 minutes,

4. Reactions

In preferred embodiments, the preferred enzyme is TdTase, and specific examples of uses of this enzyme are set forth below. However, the present invention should not be considered limited to the TdTase catalyzed synthesis of DNA and use of other enzymes capable of catalyzing the formation of a 5' to 3' phosphodiester linkage between the 3' hydroxyl group of the substrate and the 5' phosphate of the nucleoside having the removable blocking moiety is contemplated by the present invention. One skilled in the art will understand that enzyme reaction conditions are selected to allow the desired catalysis to occur and may be performed under appropriate conditions, and these conditions are well known in the art.

The reacting is performed typically between 25° C. and 42° C. for an appropriate period of time, typically between about one minute and about 30 minutes. Very short reaction times may be particularly useful if the removable blocking moiety is unstable.

For TdTase catalyzed reactions, the enzymatic conditions, which may serve as the solution in which the substrate is reacted, contains from about 0.20 and about 200 μM of the nucleotide having the removable blocking moiety protecting the 3'-hydroxyl, and from about 0.20 to 200 μM of free and unmodified 3'-hydroxyls derived from the initiating substrate. One particularly preferred buffer contains from about 10 to about 500 mM potassium cacodylate buffer (pH between 6.5 and 7.5), and from about 0:01 to about 10 mM of a divalent cation (e.g. CoCl₂ or MnCl₂). Other buffer compositions and components may be suitable for particular desired embodiment of the present invention.

For example, enzymatic conditions for deoxynucleotidyl transferase include a pH of 6.8 maintained by a potassium cacodylate buffer, 8 mmol/l of MgCl₂, 1 mmol.l of β mercaptoethanol, 0.33 mmol/l of ZnSO₄. One skilled in the art will understand that these enzymatic conditions may vary while still allowing the enzyme to catalyze the desired reaction.

The enzyme capable of catalyzing the formation of 5' to 3' phosphodiester linkages between the 3' hydroxyl group of the initiating substrate and the 5' phosphate of the nucleoside being added is present in a catalytic amount. A catalytic amount of enzyme is typically sufficient to catalyze the formation of phosphodiester bond between greater than 99% of the free 3' hydroxyls of the initiating substrate and the 5' phosphate of the nucleoside within 1 hour. Preferably, the catalytic amount of enzyme and the enzymatic conditions are such that greater than 99% of the free 3' hydroxyls of the initiating substrate are reacted within 10 minutes. In other preferred embodiments, the catalytic amount of enzyme and the enzymatic conditions are such that greater than 90% of the free 3' hydroxyls of the initiating substrate are reacted in less than 5 minutes, for example 4, 3, 2 or 1 minutes. In other preferred embodiments, the catalytic amount of enzyme and enzymatic conditions are such that greater than 99% of the free 3' hydroxyls of the initiating substrate are reacted within 2 minutes.

The TdTase enzyme is present in the buffer at a level between about 1 and 200 units per μL. One unit of TdTase catalyzes the transfer of 1 nmol of dATP to p(dT)₆₋₁₂ in 60 minutes at 37° C. Commercially available forms of TdTase include calf thymus TdTase, available from a variety of suppliers (e.g. Sigma Chemical Co., St. Louis Mo., Promega Corp, Madison, Wis., Gibco-BRL, Gaithersburg, Md.). Calf thymus TdTase may also be prepared by the procedures described by Modak, Biochemistry, 17, 3116-20 (1978), and by Bollum, Fed. Proc. Soc. Exp. Biol. Med. 17, 193 (1958).

While the substrate containing a free and unmodified 3'-hydroxyl group and the mononucleotide having the removable blocking moiety protecting the 3'-hydroxyl group can be reacted in the presence of the TdTase in the buffer solution, the substrate is preferably immobilized on a solid support, and more preferably in a synthesis column to which the buffer solution containing the reaction components is delivered.

After the appropriate incubation time, the enzyme, unreacted mononucleotide, buffer and divalent cation are separated from the initiating substrate. If the reaction was performed using a free and soluble substrate, it can be separated by conventional size exclusion chromatography or similar types of separation techniques including but not limited to ion exchange chromatography and affinity chromatography. For initiating substrates immobilized on solid supports, separation is achieved by washing the support with water or a suitable buffer.

One advantage to the present invention is that the level of unreacted hydroxyl groups on the initiating substrate after the aforementioned enzyme reaction can be exceptionally low, less than 0.1%. This minimizes the necessity for capping unreacted hydroxyl groups. In some embodiments of the present invention it may be desirable to cap the unreacted substrates before the next step in the synthesis cycle. The appropriate chemistry for accomplishing this can be derived from any of the protection strategies described previously but must be permanently affixed during all the subsequent cycles. An example of capping is acetylation by reaction of free 3'-hydroxyls with acetic anhydride and pyridine which would be applicable when acetylation (or other esterifications) are not used as the protecting group on the mononucleotide. Alternatively capping can be accomplished by reaction with t-butyldimethylchlorosilane in acetonitrile and pyridine to form a silyl ether which would be applicable when similar ethers are not used to protect the mononucleotide. In the preferred embodiment, these reactions are intended primarily for modifying immobilized initiating substrates in order to rapidly and efficiently provide appropriate capping conditions.

After the appropriate incubation time, capping reagents are separated from the initiating substrate. If the reaction was performed using a soluble substrate, it can be separated by conventional size exclusion chromatography or similar types of separation techniques including but not limited to ion exchange and affinity chromatography. For initiating substrates immobilized on solid supports, separation is achieved by washing the support with water or a suitable buffer.

The removable blocking moieties protecting the 3' position on the initiating substrate after the reaction may be removed or deblocked (deprotected) to regenerate a free and unmodified 3'-hydroxyl available for addition of the next nucleotide. One skilled in the art will understand that this may be accomplished by either chemical or enzymatic methods. For example, ester protecting groups may be removed using an esterase when R₁ of the ester protecting group discussed above is a suitable substrate for the esterase. Otherwise, the ester linkage may be cleaved by base hydrolysis, which is accomplished by contacting the protected 3'-hydroxyl group with a suitable concentration of base for a sufficient period of time. Cleavage of ester protecting groups has been well studied and appropriate reaction conditions can be readily identified that will cleave the ester but will not cleave the linkage used for capping (e.g. an ether).

The present invention incorporates the unexpected discovery that certain removable blocking moieties, the aromatic 3'-O esters of deoxynucleotide triphosphates, are unstable in commonly used buffers containing divalent cations. The instability is attributable to the presence of both the buffer and the divalent cation, and does not result from the presence of the buffer alone or the cation alone. Buffers destabilizing the ester protecting groups may contain dimethylarsinic acid (cacodylic acid), tris(hydroxymethyl) aminomethane, sodium acetate and sodium phosphate. Divalent cations destabilizing to ester blocking groups include cobalt, manganese and magnesium ions. The toluic acid ester of a deoxynucleotide triphosphate is unstable in a mixture of 1 MM CoCl₂, 100 μM potassium cacodylate, pH 6.8.

Conditions for the removal of removable blocking moieties such as ethers, carbonates, nitrates, and other protecting groups are well studied and many are compatible with the integrity of a polynucleotide chain. Removal of blocking moieties such as phosphate protecting groups, the hydroxyl is regenerated by enzymatic digestion with a phosphatase. For removal of blocking moieties when the protecting group is a sugar moiety, regeneration of the hydroxyl can be accomplished by enzymatic hydrolysis using a glycosidase.

If the removal or deblocking reaction is performed in solution, the deprotection reagents are simply added to the solution. If the reaction is performed with the initiating substrate immobilized on a solid support, then the hydroxyl group regeneration step is performed by washing the solid support with the deprotection reagents. When synthesis columns are utilized to contain the solid support, the hydroxyl group regeneration step is performed by washing the column with the appropriate agents.

After the appropriate period for removal, the initiating substrate (including both those that received an additional nucleotide and those that are capped) is again separated from the other reaction components. If the reaction was performed using a soluble substrate, it can be separated by conventional size exclusion chromatography or similar types of separation techniques including but not limited to ion exchange and affinity chromatography. For initiating substrates immobilized on solid supports, separation is achieved by washing the support with water or a suitable buffer.

As will be appreciated, the above described steps of enzyme catalyzed phosphodiester bond formation between a nucleotide having a removable blocking moiety at its 3' position and an initiating substrate, separation of the initiating substrate from reaction components, capping of unreacted initiating substrate, again separating the initiating substrate from reaction components, removing the removable blocking moiety to regenerate the 3'-hydroxyl group, and again separating the initiating substrate from reaction components are repeated as necessary until the desired object polynucleotide chain is completely synthesized.

Cleavage of a newly synthesized polynucleotide strand from the solid support and/or from the initiating substrate can be accomplished by either chemical or enzymatic reactions. In the case of a chemical reaction, if the initiating substrate terminal nucleoside (containing the free and unmodified 3'-hydroxyl group) is a deoxyguanosine methylated at the 7 position of the base:

Support-dCCCCCCCCCCC-Me⁷ -G-object polynucleotide (SEQ ID NO.1) reaction with 1 M piperidine in water at 90° C. will cleave the chain at this position yielding only the desired polynucleotide in solution. This method can yield a polynucleotide chain containing only the predetermined sequence and can be performed either on immobilized chains (to effect cleavage) or on solution synthesized chains to remove the initiating substrate. Alternatively, the dG^(7me) can be positioned at any location within the initiating substrate or the object polynucleotide where cleavage is desired. Other examples of modified base-specific cleavage of polynucleotide chains have been extensively described in the literature (See, Ambrose and Pless, Meth. Enzymol.,I Vol 152: 522-538.)

Enzymatic removal of the polynucleotide chain may be accomplished by reaction with a specific restriction endonuclease. For example, if the initiating substrate oligonucleotide has the following structure:

Support-dCCCCCCCCCCCCCCCTGCA-3'--OH (SEQ ID NO.2) and the object polynucleotide begins with a G, the resulting newly synthesized chain can be cleaved from the support by reaction with Pst 1 restriction enzyme. This method assumes there are no additional Pst 1 restriction sites in the newly synthesized chain and that one has annealed an appropriate oligonucleotide to the Pst 1 site to render it in a double stranded form for recognition by the enzyme (e.g. an annealing oligonucleotide with the following structure: 3'-dGGGGGGGGGGGGGGGACGT-5'(SEQ ID NO.3) for the example above). Depending on the desired first nucleotide of the object polynucleotide, as well as the ultimate sequence of the polynucleotide, one can choose from a wide variety of restriction enzymes to accomplish the cleavage of only the desired sequence. This method can yield a polynucleotide chain containing only the predetermined sequence and can be performed either on immobilized chains (to effect cleavage) or on solution synthesized chains to remove the initiating substrate. Alternatively, appropriate restriction endonuclease recognition sequences can be positioned at any location within the initiating substrate or the object polynucleotide where cleavage is desired.

The combined initiating substrate and object polynucleotide can be cleaved from the solid support by chemical methods. How the cleavage is performed will depend upon the nature of the initiating substrate and how it was attached to the solid support. Covalent labile bonds, such as for example a trityl group, can be cleaved by washing the support with an appropriate protic acid. Numerous other cleavage strategies have been described. In the case of a non-covalent attachment, as for example avidin-biotin binding, release of the combined substrate and object polynucleotide will occur upon incubation with 8 M guanidine-HCl, pH 1.5.

If the entire synthesis was performed using a soluble initiating substrate, the initiating substrate containing the object polynucleotide can be separated from the various capped oligo- and polynucleotides by conventional chromatographic techniques, such as polyacrylamide gel electrophoresis. Similarly, if the initiating substrate is cleaved from the object polynucleotide by chemical or enzymatic means (e.g. by reaction with piperidine or by restriction endonucleases digestion as described above) conventional chromatography can be used to purify the object polynucleotide.

If the synthesis was performed using an initiating substrate immobilized to a solid support, cleavage from the solid support can be accomplished by either chemical or enzymatic means to retrieve either the combined initiating substrate and object polynucleotide or the object polynucleotide alone. In each instance, the object polynucleotide will be contaminated with capped oligo- and polynucleotides which can be separated from the object polynucleotide by polyacrylamide gel electrophoresis.

An alternative strategy for the synthesis and recovery of the object polynucleotide involves immobilization of the nucleotide. In this instance, the nucleotide is protected at the 3'-hydroxyl by a linker which is attached to a solid support. The linker attachment to the nucleotide can be by an ester or by any of the aforementioned protecting group strategies. Solid supports containing various functional groups (e.g. amines, amides, biotin, avidin, and the like) are generally available and can be adapted to the particular requirements of the nucleotide linker. For example, a nucleotide linker containing a biotin molecule can be bound to agarose using an avidin functional group attached to the agarose.

Using an immobilized nucleotide, the TdTase reaction would join a free initiating substrate, in solution, to the immobilized nucleotide, thereby immobilizing only those initiating substrates which have participated in the enzyme reaction. Initiating substrates which had not participated in the TdTase reaction would be easily removed by rinsing the solid support with an appropriate buffer. Regeneration of the 3' hydroxyl on the initiating substrate is accomplished by the same techniques as described previously.

Subsequent to the regeneration and cleavage step, the initiating substrate is rinsed away from the solid support and separated from the regeneration/cleavage solution containing free nucleotides by conventional techniques such as size exclusion chromatography, ion exchange or affinity chromatography. The next immobilized nucleotide, contained on a new population of solid support particles, is then mixed with the initiating substrate and the appropriate buffers in order to repeat the TdTase coupling reaction.

By immobilizing the nucleotide rather than the initiating substrate, a capping reaction is obviated since the object polynucleotide is separated from unreacted initiating substrate at every cycle. Similarly, if the cleavage reaction fails to release all of the object polynucleotide chains, those polynucleotides which continue to be attached to the solid support are removed prior to the subsequent TdTase reaction.

It is envisioned that various newly synthesized polynucleotide chains will subsequently be joined together by a polymerase/ligase type of reaction in order to form longer polynucleotide sequences that are in a double stranded form. For example, newly synthesized polynucleotides A and B may have the structures depicted below:

    A: 3'--p(dN)--dCCCCCCCCC--5'(SEQ ID NO.4)

    B: 3'--p(dN)--dGGGGGGGGG--5'(SEQ ID NO. 5)

where p(dN) is the predetermined object polynucleotide sequence unique to either the A or B polynucleotide. In the presence of the Klenow fragment of DNA polymerase I, and T4 DNA ligase, as well as the appropriate buffers and nucleotides, a double stranded polynucleotide will be formed in which the two object polynucleotides have been "stitched" together to form the longer double stranded polynucleotide C:

    3' (SEQ ID NO.4)                                                               5' (SEQ ID NO.5)                                                          

This reaction can be performed when one of the polynucleotides is still attached to a solid support or when both polynucleotides have been released into solution by the techniques described previously.

B. Polynucleotides

The present invention contemplates oligonucleotides and polynucleotides produced using the methods of this invention. These polynucleotides preferably have a predetermined nucleotide sequence that was produced by selecting the order in which the individual nucleotides were added to the initiating substrate so when synthesis is completed a polynucleotide having a preselected sequence is produced.

In preferred embodiments, a polynucleotide produced according to the methods of this invention is greater than five nucleotides in length. Polynucleotides produced according to the methods of the present invention may contain large numbers of nucleotides, for example 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200 and greater than 200 nucleotides. The length of a polynucleotide produced using the methods of the present invention may be of a length that is intermediate between the aforementioned nucleotide lengths, such as 5, 15, 16, 25, 26 or any other numeral intermediate between the specific lengths. The length of the polynucleotide produced according to the present invention is limited only by the efficiency of the processes of the present invention.

Polynucleotides produced by the methods of the present invention can contain nucleotide sequences that have a variety of biologic and molecular biologic uses. One skilled in the art will understand the uses for long polynucleotides having a predetermined sequence. For example, many manipulations commonly performed in modern molecular biology could be greatly simplified through the availability of inexpensive, long polynucleotides having a predetermined sequence.

Examples of molecular biology procedures and manipulations that would be simplified using the polynucleotides produced according to the methods of the present invention include cloning and expression of various nucleic acids both in vitro and in vivo. For examples of techniques and manipulations that are simplified using polynucleotides produced using the methods of the present invention see, Methods in Enzymology, Vol. 152 edited by Berger and Kimmel; Maniatis et al., Molecular Cloning a Laboratory Manual, Cold Spring Harbor Press, 1990; Current Protocols in Molecular Biology, edited by Ausubel et al., John Haley and Sons, New York, 1987.

For example, polynucleotides of the present invention could be used to introduce restriction sites into a nucleic acid, to introduce various nucleotide sequences having biological activity such as promoters, and to adjust reading frames. The number of possible applications using oligonucleotide and polynucleotide produced according to the present invention is large as one skilled in the art will understand. Oligonucleotides and polynucleotides produced according to the present invention are especially useful when the application requires long oligonucleotides and polynucleotides.

C. Compositions of Matter

The present invention also contemplates compositions of matter comprising a catalytic amount of a template independent enzyme and a nucleoside 5'-triphosphate having a removable blocking moiety protecting the 3' position of the nucleoside 5' triphosphate and other preferred compositions further comprise an initiating substrate of the present invention.

The present invention contemplates compositions having an amount of template independent enzyme capable of catalyzing the formation of a 5' to 3' phosphodiester linkage between 99 percent of the unprotected 3' hydroxyl groups present on an initiating substrate of the present invention and a nucleoside 5'-triphosphate having a removable blocking group protecting its 3' position within 10 minutes. Other compositions are contemplated that contain an amount of enzyme capable of performing the same reaction to the same extent within 2 minutes.

The compositions contemplated by the present invention includes compositions in which the template independent polynucleotide enzyme present is a template independent polynucleotide polymerase. Examples of preferred template independent polynucleotide polymerases include TdTase and enzymes with similar activities.

The composition of the present invention includes a nucleoside having a removable blocking moiety protecting the 3' position of the nucleoside. Particularly preferred are nucleoside 5'-triphosphates having a removable blocking moiety protecting the 3' position of the nucleoside. The various useful removable blocking moieties are described herein.

In preferred compositions, the nucleoside having the removable blocking moiety protecting the 3' position is present at a concentration of 1 nanomolar to 100 mmolar. In other preferred compositions, the nucleoside is present at a concentration of 1 micromolar to 1 millimolar. In other preferred embodiments, the nucleoside 5'-triphosphate having the removable blocking moiety protecting its 3' position is present at a concentration of 10 times the Km of the enzyme present in the composition.

D. Automated Processes and Apparatus

The present invention contemplates the incorporation of the method described herein in an automated process in an apparatus and in devices. For example, the various buffer and reagent solutions of the inventive process can be provided to synthesis columns containing initiating substrates affixed to solid support matrices by the use of flexible tubing attached to peristaltic pumps or similar devices controlled by a microprocessor programmed to meter the exact quantities of the materials in the correct sequence.

Regardless of the equipment employed, it can be appreciated that the method of the present invention can create single phosphodiester bonds between desired nucleosides with very high efficiency and can potentially be used to produce long chain polynucleotides in high yields.

One example of such an automated process is depicted by a porous frit 13 in a glass or plastic vessel 15 shown in FIG. 3. The insoluble matrix 11 consists of a solid support such as cellulose, SEPHAROSE™ or CPG to which a nucleotide, nucleoside or polynucleotide is covalently linked at the 5'-position of the terminal nucleotide or to which an oligo- or poly- nucleotide or nucleoside having a terminal nucleoside with a free 3'-hydroxyl group is covalently attached via the 5'-hydroxyl group. The matrix II may itself be a covalent component of the porous frit 13 or it may be a separate entity.

The various solutions involved in the synthesis cycle are stored in stock containers 21, 23, 25, 27, 28 and 29. Solutions are introduced into the vessel 15 through tubes 41, 43, 47, 48, 49 and 50 attached to pumps 31, 33, 35, 37, 38, 39 and 40. The composition of the stock solutions would depend on the stability of the various components of the mixture. A simplified automated process would combine many of the various reagents as follows:

The stock containers 21, 23, 25 and 27 contain buffer solutions 51, 53, 55 and 57, respectively, having a concentration between about 10 and about 500 mM of sodium cacodylate (pH 7.0 at 25° C.), between about 0.1 and about 1.0 mM of dithiothreitol. Each buffer solution also contains between about 0.10 and about 200 units per μL of an enzyme (e.g. TdTase) suitable for phosphodiester bond formation. Buffer solution 51 in stock container 21 also contains between about 0.20 and about 200 μM of deoxyadenosine 5'-triphosphate having a blocked 3'-hydroxyl group. Buffer solutions 53, 55 and 57 in stock containers 23, 25 and 27, respectively, contain equivalent concentrations of deoxycytosine 5'-triphosphate, deoxyguanosine 5'-triphosphate and thymidine 5'-triphosphate respectively, each of which also has blocked 3'-hydroxyl groups. Buffer solution 58 in stock container 28 contains an appropriate reagent for deblocking the blocked 3'-hydroxyl groups of the four nucleosides as described previously. Stock solution 59 in stock container 29 contains a suitable neutralization buffer at pH 7.0, such as 0.1 M sodium cacodylate. Stock solution 30 in container 60 contains a suitable enzymatic solution or chemical reagent for releasing the final product from the solid support as described previously.

The various stock solutions are drawn into the tubing, which each feed onto the matrix. Recycling of buffer solutions 51, 53, 55 and 57 from the vessel 15 to their respective stock containers 21, 23, 25 and 27 can be accomplished by way of the tubing 61, 63, 65 or 67. Allocation of fluid to the appropriate tubing can be accomplished by a distribtor, 71, which directs the fluid from the vessel 15. Distributor devices, such as multiport stopcocks and fraction collectors are familiar to one of ordinary skill in the art. Movement of the liquid through tubing which is downstream from the distributor (e.g. 61, 63, 65, 67, 69, 73) can be accomplished by additional pumping as needed (e.g. pump 83, 85, 87, 89, 91, 93). At least one microprocessor controls the peristaltic pumps and distributor so as to provide for the sequential addition and recycling of the nucleotides to form a nucleotide chain having a predetermined nucleotide sequence. In the preferred processes, the initiating substrate linked to the matrix 11 is first exposed to one of the solutions 51, 53, 55, or 57 for a sufficient time to enable attachment of the nucleotide to the initiating substrate. This solution is then recycled into the appropriate container (21, 23, 25, or 27).

The amount of TdTase and 5'-nucleoside triphosphate contained in the buffer solutions 51, 53, 55 and 57 is sufficient for the synthesis of a predetermined quantity of an object nucleotide chain. For example, for the automated synthesis of 1 nmol of a nucleotide chain consisting of 1,000 bases (about 330,000 MW and about 330 μg), each buffer solution will contain an excess of each 5'-nucleoside triphosphate (about 500 nmol) and an excess of TdTase (about 100 to about 1,000 units). Only a small fraction of the buffer solution containing the TdTase and the 5'-nucleoside triphosphate is used for each cycle of nucleotide addition. Matrix 11 is next exposed to solution 58 for a sufficient period of time to remove blocking groups from the growing oligo- or polynucleotide chain. This solution is not recycled but is distributed to tube 69 by the distributor 71, utilizing pump 91.

Matrix 11 is then briefly exposed to solution 59 in order to wash out the deblocking reagent. The next enzyme/nucleotide solution, either 51, 53, 55 or 57, is then added to matrix 11 to continue the cycle.

Finally, after the desire oligo- or polynucleotide is synthesized, cleavage of the object polynucleotide from the solid support occurs by the controlled addition of solution 60 which can be a restriction endonuclease solution or a solution to effect the chemical cleavage from the solid support (e.g., 1 M piperidine) as described above. The microprocessor directs the distributor 71 and pump 93 to move the final product through tube 73 to be recovered for final workup.

As an example of the control of the various reactions by the microprocessor, the synthesis of the oligonucleotide ACGT onto an initiating substrate would involve the sequence of commands shown below. The duration of and between each command is sufficient to allow any particular reaction or fluid movement to proceed adequately.

    ______________________________________                                         Microprocessor Command                                                                            Intended Result                                             ______________________________________                                         1.     Pump 31 on.     Solution 51 added to                                                           vessel 15                                               2.     Pump 31 off.    Nucleotide addition                                                            reaction proceeds                                       3.     Distributor 71 on,                                                                             Recycle reaction fluid                                         pump 83 on.     fluid via tube 61                                       4.     Distributor 71 off,                                                                            Solution 58 added to                                           pump 83 off.    vessel 15; initiate                                                            deblocking reaction                                     5.     Pump 38 off.    Deblocking reaction                                                            proceeds                                                6.     Distributor 71 on,                                                                             Discard deblocking fluid                                       pump 91 on.     via tube 69                                             7.     Pump 39 on.     Neutralize/wash reaction                                                       chamber                                                 8.     Distributor 71 off,                                                                            Solution 53 added to                                           pump 69 off, pump                                                                              vessel 15                                                      33 on.                                                                  ______________________________________                                    

This cycle is repeated for the other nucleotides until the desired sequence is synthesized. When collection of the final product is desired, the microprocessor gives the following commands after step 7 above.

    ______________________________________                                         1.    Pump 40 on.    Solution 60 added to vessel                                                    15                                                        2.    Pump 40 off.   Cleavage reaction of the                                                       initiating substrate proceeds                             3.    Distributor 71 on,                                                                            Collection of synthesized DNA                                   pump 93 on     via tube 73                                               ______________________________________                                    

The alternative strategy envisions the use of immobilized nucleotide triphosphates in order to separate the object nucleotide from non-reacting substrate polynucleotides at every cycle. The automated process using immobilized nucleotide is considerably different from the process involving an immobilized substrate polynucleotide. After the coupling reaction of the triphosphate and the polynucleotide, the eluate contains unreacted polynucleotides, reaction buffer, and TdTase enzyme. The object polynucleotide is attached to the solid support. In order to recycle the enzyme back to its reservoir, the contaminating polynucleotide is first removed by passing the solution through a column containing hydroxyl apatite, for example, or a similar polynucleotide adsorption medium through which the enzyme will pass. This column will have sufficient capacity to adsorb all of the anticipated contaminating polynucleotides produced by every cycle.

After the deblocking step, the object polynucleotide is now contained in a solution of nucleotide triphosphates (with 3'-hydroxyls), and deblocking buffer (e.g., NaoH or phosphatase). These two contaminants can be removed by size exclusion chromatography (e.g., SEPHAROSE™ CL-6B) or by any of a number of commonly used techniques for separating small molecules from oligo- and polynucleotides. An example of this is adsorption of the object polynucleotide by annealing to oligo dA-cellulose column (3'-5') which would simply require the initiating substrate to contain oligo dT. A-T annealing is the preferred embodiment in the automated process since elution of the object polynucleotide can be accomplished by incubation with H₂ O. The annealing of the object nucleotide simply requires the neutralization of the cleavage reaction by addition of a sufficient quantity of HCl or by the inclusion of an appropriate amount of NaCl (.sup.˜ 0.1 -0.5 M) in the deblocking buffers.

An automated process incorporating the immobilized nucleotide triphosphate alternative method of the present invention is depicted in FIG. 4. The process utilizes a nucleotide triphosphate immobilized to a solid support by, but not limited to, techniques describe previously, and compromising stock solutions 151, 153, 155, 157 in stock containers 121, 123, 125, 127. The stock solutions contain a tethered nucleotide, appropriate buffers and sufficient enzyme to effect the synthesis of the desired amount of predetermined sequence. The immobilization material has fluid dynamic properties allowing it to be moved through the various tubes as required. Substances which have these characteristics (e.g. gels and viscous suspensions) are familiar to one of ordinary skill in the art. The reaction vessel, 115, contains a reaction chamber, 111, and a stopcock, 113. Stopcock 113 has three positions A, B, C. Position A aligns a hole of sufficient diameter with the tubing so as to allow the various components of the synthesis to pass unimpeded. Position B aligns a porous frit to which is covalently attached oligonucleotides of deoxyadenosine (dA) approximately 20 bases in length. The quantity of oligo dA is sufficient to anneal the entire quantity of oligo dT, attached to the initiating substrate as described above. In position B, only solutes can pass through and no immobilization material (e.g. those contained in solutions 151, 153, 155, 157). Position C closes all flow. Reaction chamber 111 contains the initiating substrate in water, solution 161. As mentioned above the initiating substrate contains oligo dT which is ≧20 nucleotides in length. Stock containers 121, 123, 125, 127, 128 and 129 are connected to the reaction vessel 115 by way of peristaltic tubing or some similar material to effect transport of the reagents contained in the stock containers. Additionally, vessel 115 is connected to tubing, 181, which contains a distributor, 171 which serves to divert the flow of solutes either to tubing 183 or tubing 185 or recycled back to stock containers after passing through adsorption media (e.g. hydroxylapatite) contained in 130, 132, 134 or 136 via tubes 187, 189, 191 or 193. Tubing 183 feeds back to vessel 115; tubing 185 feeds into a discard container. Solute movement through the tubing is facilitated by pumps, 131, 133, 135, 137, 138, 139, 163 (e.g. peristaltic pumps) or similar devices which will force fluids, gels or viscous suspensions through tubing to desired destinations.

The automated process for synthesis involves the following flow of solutes and stopcock positions controlled by at least one microprocessor. The microprocessor controls pumps, the distributors and the stopcock positions:

1) Stopcock 113, position C (blocked); stopcock 171 in discard position (tube 185); tethered nucleotide, buffers (solution 151, 153, 155, or 157) are combined with substrate oligonucleotide or polynucleotide (solution 161) to yield a tethered oligonucleotide or polynucleotide.

2) Stopcock 113, position B (oligo-dA frit); distributor 171 in recycle position (tube 187, 189, 191 or 193); unreacted polynucleotide is adsorbed in containers 130, 132, 134 or 136; enzyme, buffers are recycled.

3) Stopcock 113, position B (oligo-dA frit); stopcock 171 in discard position (tube 185) ; cleavage buffer (solution 158) added to immobilized polynucleotide yielding a free polynucleotide annealed to the oligo-dA frit; released mononucleotides discarded.

4) Stopcock 113, position B (oligo-dA frit); stopcock 171 in recycle position (tube 183); water (solution 159) is passed through the chamber and frit to release the annealed polynucleotide from the frit and return the polynucleotide to the reaction chamber. The free polynucleotide resides in tubing 183 during Step 5.

5) Stopcock 113, position A (completely open); stopcock 171 in discard position (tube 185); immobilized substrate discarded prior to entry of polynucleotide back into reaction chamber.

6) The final product is recovered via tube 185 with stopcock 113 in position A.

It will be appreciated that for these separation techniques to be effective, the starting oligonucleotide or polynucleotide substrate should consist of at least approximately 20 nucleotides. The composition of the starting oligonucleotide or polynucleotide can be anything that will enable the subsequent purification steps as well as the ultimate cleavage of the object oligonucleotide or polynucleotide from the starting oligo- or polynucleotide. An example of a nucleotide modification that would enable final separation of starting oligonucleotide or polynucleotide from the object polynucleotide is biotinylation of the primary amines of dA, dC, or dG. Additionally, a starting oligonucleotide substrate containing 7-methyl guanosine at the 3' end will provide a cleavage site, as described previously, for ultimate recovery of the object polynucleotide.

Thus, it can be appreciated that, regardless of the equipment employed, the method of the present invention efficiently produces oligonucleotide or polynucleotides in high yield, with a significant reduction in the number of unreacted sequences per cycle. This greatly simplifies the ultimate isolation of the object nucleotide chain for further experimentation. Once isolated, the nucleotide chain may be "stitched" together with other polynucleotides and formed into double stranded DNA as described above or may be amplified by conventional means such as by polymerase chain reactions for use in recombinant DNA end use applications.

E. Kits

The present invention also contemplates a kit for carrying out the present inventive procedure. Typically, a kit would contain all the solutions and substances needed for performing the instant synthesis procedure together with instructions for carrying out the procedure. A typical kit for carrying out the claimed process would include an initiating substrate of the present invention, various nucleoside 5' triphosphates of the present invention having a removable blocking moiety protecting the 3' position, an enzyme of the present invention capable of catalyzing the formation of a 5' to 3' phosphodiester linkage between the unprotected 3' hydroxyl group of the initiating substrate and the 5' phosphate of the blocked 5'-triphosphate. Additional components and solutions optionally included in the kit are various required reaction solutions and reaction buffers, reaction vessels in which to perform the assay, deblocking chemicals, solutions or enzymes of the present invention.

A kit for carrying out the instant synthesis may also contain initiating substrates that are attached to a solid support. The kit may contain a variety of initiating substrates attached to solid supports, so that the first nucleoside of a desired oligonucleotide can be selected by selecting the appropriate initiating substrate.

In other kits for carrying out the present process, initiating substrates having oligonucleotides of a preselected nucleotide sequence are provided to allow oligonucleotides and polynucleotides having this preselected nucleotide sequence incorporated into its 5' to be produced. Kits with this type of initiating substrate can provide easy synthesis of oligonucleotides having, for example, a restriction endonuclease cleavage site present in its nucleotide sequence.

Other kits contemplated by the present invention include initiating substrates having various derivatized nucleotides, nucleoside analogs, or non-nucleoside molecules that allow oligonucleotides produced using those initiating substrates to have useful properties such as easily coupling to other molecules, unique biologic activity or other unique features. Other kits would have an initiating substrate of the present invention such as double-stranded oligonucleotides.

The present invention also contemplates kits for producing nucleoside 5'-phosphate and nucleoside analogs having a removable blocking moiety protecting its 3' position. These kits would allow a user to produce nucleoside 5' triphosphates and equivalents that are useful in the practicing of the present invention.

The present invention also contemplates kits that contain additional components for carrying out other molecular biologic procedures in conjunction with the methods of the present invention. For example, components of the present invention may be present in a kit that contains vectors and concomitant cell lines for expression of a protein or enzymes for desired modifications of amplification of the nascent or fully synthesized object polynucleotide.

The following examples further illustrate the present invention, and are not to be construed as limiting the scope thereof. Unless otherwise indicated, materials were obtained from Promega, Fisher, Aldrich, Sigma, Pharmacia, Gibco-BRL, Bio-Rad and New England Biolabs. All parts and percentages are by weight unless expressly indicated to be otherwise, and all temperatures are in degrees Celsius.

EXAMPLES Example 1 Synthesis of protected nucleotides.

A. Synthesis of protected nucleotides by reaction of the 3' hydroxyl with carboxylic acids.

i. Toluic acid. One hundred μL of 1 M toluic acid (either the para or ortho isomer) in anhydrous N,N-dimethylformamide (DMF) was mixed, in a nitrogen atmosphere, with 100 μL of 1 M carbonyldiimidazole, also in anhydrous DMF. Formation of the imidazolide was allowed to proceed at room temperature for 30 seconds. To this mixture was added 100 μL of a 50 mM solution of deoxynucleoside 5'-triphosphate in water. Formation of the toluoyl-dNTP ester proceeded at room temperature for 12 hours.

The triphosphates (including both 3'-hydroxy unreacted triphosphates and the 3'-toluoyl triphosphates) were separated from the other reaction components by precipitation in the presence of 9 volumes of acetone. The insoluble nucleoside triphosphates were recovered by centrifugation and removal of the soluble components. The nucleosides were then redissolved in 100 μL of water, toluoyl ester was separated from the starting nucleotide by chromatography on Whatman 3MM cellulose paper which had been prewashed first in isopropanol, butanol, and water in the proportion of 2:2:3 by volume, and then in water alone, prior to drying. The solvent to achieve separation by ascending chromatography contained isopropanol, butanol, and water also in the proportion of 2:2:3 by volume. Detection of the various separated components was by ultraviolet light absorption at 254 nm. The dNTP-3'-O-toluate was cut from the paper and eluted into water. After concentration to dryness in vacuo, the nucleotide ester was redissolved in water to a final concentration of .sup.˜ 0.1-1 mM. This material was then subjected to mass spectroscopic analysis to confirm the structure. The predicted mass numbers for the toluoyl esters of dATP, dCTP, dGTP, and TTP are 608, 584, 624, and 599 respectively. In each case these mass numbers were observed. These mass numbers were not observed in the spectra obtained from the unprotected deoxynucleoside triphosphates.

In related experiments, a variety of esters have been formed from carboxylic acids to yield aromatic or aliphatic protecting groups at the 3' position.

ii. Benzoic acid and dimethylbenzoic acid. Benzoic acid as well as the 2,6- and 3,5-dimethylbenzoic acid isomers were esterified to nucleotide triphosphates by the same methods described above in order to evaluate position effects of methyl groups on the overall kinetics of subsequent enzyme reactions.

iii. 4-Nitrobenzoic acid. Esterified to the 3'-hydroxyl using the same methods.

iv. 2-Napthoic acid. Esterified to the 3'-hydroxyl using the same methods.

v. Isovaleric acid. Esterified to the 3'-hydroxyl using the same methods.

Depending on the particular stability requirements, the procedures are readily adaptable to the utilization of virtually any carboxylic acid for esterification and protection of the 3'-hydroxyl of a nucleotide triphosphate.

B. Synthesis of protected nucleotides by reaction of the 3'-hydroxyl with an ether.

2.5 mg of deoxynucleoside triphosphate was dissolved in 100 μL anhydrous DMSO containing 5.2 mg para-toluene sulfonic acid. The solution was cooled to 0° C.; 200 μL of ethyl vinyl ether was then added and allowed to react for 3 minutes. 200 μL of 1 M Tris-Cl, pH 9.0 was then added with vigorous shaking resulting in the formation of two liquid phases. The ether phase was discarded and to the aqueous phase was added 10 volumes of absolute ethanol to precipitate the nucleotides. After incubation for 10 minutes at -20° C. the nucleotide pellet was obtained by centrifugation, and was redissolved in 0.25 M NaCl followed by the addition of 10 volumes of ethanol. The final precipitated nucleotide pellet was dissolved in 100 μL of water and applied to Whatman 3MM paper for chromatography to separate the nucleotide ether from unreacted nucleotides. Ascending chromatography was performed as described above with a solvent of isopropanol, butanol, and water in the proportion of 2:2:3 by volume. The nucleotide ether, where the protecting group is an ethoxyethyl moiety, migrates with an R_(f) (relative to the unreacted nucleotide) of 1.25. This species was cut out of the paper and eluted with water to yield the purified derivative.

C. Synthesis of protected nucleotides by phosphorylation of the 3'-hydroxyl.

i. Chemical synthesis. 2.0 mg of deoxynucleoside triphosphate was dissolved in 60 μL anhydrous DMSO, 2 μL orthophosphoric acid, and 6 μL triethylamine. To start the reaction, 6 μL of trichloroacetonitrile was added and the mixture was incubated at 37° C. for 30 minutes. The reaction was cooled to room temperature and 5 μL of 5 M NaCl was added followed by 1.4 mL of acetone. The precipitation of nucleotide was allowed to proceed at -20° C. for 10 minutes; nucleotide was recovered by centrifugation, redissolved in 100 μL 0.25 M NaCl and reprecipitated by the addition of 1.4 mL of absolute ethanol. The final nucleotide was recovered by centrifugation, dissolved in 100 μL of water and applied to Whatman 3MM paper. Separation of nucleotide tetraphosphate (5'-triphosphate, 3'-monophosphate) was by ascending chromatography in isopropanol, butanol, and water in the proportion of 2:2:3 by volume. The nucleotide tetraphosphate migrates with an R_(f) (relative to the unreacted nucleotide) of 0.90.

Alternatively, the same phosphorylation reaction components (phosphoric acid, teithylamine, and deoxynucleotide) can be dissolved in formamide and the reaction allowed to proceed at 70° C.

Alternative chromatography solvents include 1-propanol, concentrated ammonia, water (55:20:25), in which case the Rf of the tetraphosphate or 3mm paper is approximately 0.8 relative to unreacted triphosphate.

ii. Enzymatic synthesis. Deoxynucleoside 3'-monophosphates (Sigma) were phosphorylated at the 5' position using polynucleotide kinase. The reaction was performed at pH 9.0 to minimize the inherent 3' phosphatase activity of the enzyme, in a solution consisting of 50 mM Tris-Cl (pH 9.0), 10 MM MgCl₂, 1.5 mM spermine, 5 mM dithiothreitol, 3 mM 3'-dNMP, 30 mM ATP, and 20 units of polynucleotide kinase (Sigma or Pharmacia) in a final volume of 200 μL for 16 hours at room temperature. The phosphorylation was monitored by chromatography (Whatman 3MM paper) after removal of the ATP by chromatography through Affi-Gel 601 (Bio-Rad).

The nucleoside 5'-monophosphate 3'-monophosphate was further phosphorylated at the 5' position using nucleoside monophosphate kinase and pyruvate kinase in a solution containing 50 mM Tris-Cl (pH 7.4), 10 mM MgCl₂, 1.5 mM spermine, 5 mM dithiothreitol, 30 mM ATP, 4 mM phosphoenolpyruvate, 10 mM KCl 150 μg/mL pyruvate kinase (Sigma), and 100 μg/mL nucleoside monophosphate kinase (Boehringer Mannheim). The reaction proceeded at room temperature for 30 minutes (for dA) to 4 hours (for dT, dC, dG). The deoxynucleotides were again separated from ATP by chromatography on Affi-Gel 601 followed by concentration to dryness in vacuo and dissolution in 200 μL of water. Purification of the tetraphosphate from other nucleotides was by paper chromatography as described above.

D. Synthesis of a benzoylated nucleotide tethered to agarose beads.

One hundred μL of 1 M p-aminobenzoic acid in 10 mM sodium hydroxide, 90% DMF, pH 10, was mixed with 100 μL of 1 M N-succinimidyl 3-(2-pyridylthio)propionate also in basic DMF. Coupling of the succinimidyl to the amine was allowed to proceed at room temperature for four hours. The reaction was monitored and the coupled product purified by thin layer chromatography on silica gel using a mixture of chloroform and methanol as the solvent. Silica gel containing the coupled product was extracted with DMF, filtered, and added to an equal volume (.sup.˜ 100 μL) of 1 M carbonyl diimidazole in anhydrous DMF followed immediately by the addition of 100 μL of 50 mM deoxynucleoside triphosphate. The product, a dNTP coupled by an ester linkage to a tether containing an amide and a disulfide bond, was treated with 2-mercaptoethanol in a nitrogen atmosphere to expose the sulfhydryl, and subsequently purified by purification from 10 volumes of absolute ethanol. The purified product, under nitrogen atmosphere, was then incubated with 0.2 mL Affi-Gel™ 501, an organomercurial crosslinked agarose, in 50 mM sodium phosphate, pH 6, at room temperature for one hour to allow covalent mercaptide bonds to form.

Example 2 Regeneration of the 3' hydroxyl from protected nucleotides.

A. Deprotection of nucleotide 3'-O esters. The ester protecting groups were removed from the 3'-hydroxyl of dNTPs by incubation in 1 mM CoCl₂ and 100 mM potassium cacodylate, pH 6.8. The conversion of the ester to the hydroxyl was evaluated by cellulose and paper chromatography and was found to be nearly quantitative after 15 minutes of incubation at 37° C. These deprotection conditions were equally effective for each of the four nucleotides. Further evaluation revealed that the instability was due to both the buffer and the divalent cation.

The instability of the toluoyl esters in other commonly used TdTase coupling reaction buffers was explored. The relative degree of instability due to the various buffers (in the presence of 1 mM CoCl₂) was found to be cacodylate>tris(hydroxymethyl) aminomethane>sodium acetate or phosphate. Instability due to the cations (in cacodylate buffer) was found to be Co⁺⁺ >Mn⁺⁺ >Mg⁺⁺ Degradation of the esters was first observable after about three minutes of incubation. Incubation in buffer alone or cation alone produced no observable degradation. In common with many other types of esters, these deoxynucleotide esters were also sensitive to basic conditions e.g. incubation in 10-100 mM NaOH. The other esters of dNTPs (isovaleroyl, dimethylbenzoyl, napthoyl, and nitrobenzoyl) were also relatively unstable in the buffered divalent cations.

In general, these results identify unexpected properties of the esters of dNTPs and provide a convenient and gentle method for the rapid removal of these esters from the growing polynucleotide chain after a coupling reaction. These deprotection conditions are sufficiently gentle to enable the synthesis of an object polynucleotide onto a pre-existing double stranded DNA without denaturation of the DNA at every cycle. This capacity for "add-on" synthesis using a double-stranded polynucleotide as the initiating substrate is demonstrated in Example 5 below.

B. Deprotection of nucleotide 3'-O ethers. The 3' ethoxy ethyl ether of the nucleoside triphosphates were stable in cobalt containing buffers but could be readily removed by incubation in 5% acetic acid at room temperature or by incubation in 0.5 N HCl/THF at 0° C.

C. Deprotection of nucleotide 3'-phosphates. The 3'-phosphate was specifically removed by incubation of the nucleotide in a solution containing 50 mM sodium acetate, pH 5.5, 10 mM MgCl₂, and 20 units of nuclease P1, an enzyme which specifically removes phosphates from the 3' position of mononucleotides. The reaction was allowed to proceed for 90 minutes at 37° C. This enzyme would not be appropriate in the case of a protected nucleotide attached to an initiating substrate since it is also a phosphodiesterase. In this case an alternative phosphatase can be used, which is described below.

Example 3 Efficiency of enzyme catalyzed phosphodiester bond formation using protected deoxynucleotidyl triphosphates.

The ability of a polymerizing enzyme, TdTase, to catalyze the creation of a phosphodiester bond between an initiating polynucleotide substrate and a 3'-O-protected deoxynucleotidyl triphosphate was measured using a transferase/ligase assay. In this assay, transfer of a nucleotide to an initiating substrate DNA, such as a linearized vector, will inhibit the ability of the vector to be relegated into a circular form. The relative quantity of circular vector DNA in each reaction can then be measured by bacterial transformation.

100 μM deoxynucleotide 3' ethoxy ethyl ether, or 3' phosphate were incubated with Pst 1-digested Puc 8 vector DNA (1 μg) in the presence of 1 mM CoCl₂, 0.1 mM DTT, potassium cacodylate, pH 6.8, and 40 units TdTase (Promega) in a total volume of 25 μL. In the case of the 3' ester, the same reaction was performed with the exception that the CoCls was replaced with MnCl2 and the cacodylate buffer was replaced with Tris-Cl. The reactions were allowed to proceed at 37° C. for 15 minutes at which time they were terminated by the addition of 1 μL of 100 mM Na₂ EDTA, 0.1% sodium dodecyl sulfate. The Puc 8 DNA was separated from the other components of the reaction by chromatography through aqueous packed Sepharose™ CL-6B and was then used in a ligase reaction. The ligation reaction consisted of the Puc 8 DNA, 1 mM Na₂ ATP, 50 mM Tris-Cl, pH 8.0, 1 mM MgCl₂, 100 μg/mL bovine serum albumin and 100 units of T4 DNA ligase (New England Biolabs). The ligation reaction was allowed to proceed at 16° C. for 18 hours. The Puc 8 DNA was again recovered by Sepharose™ CL6B chromatography.

The inhibition of the ligation reaction due to the addition of a nucleotide to the Puc 8 DNA by TdTase was quantified by a bacterial transformation assay. Competent E. Coli JM109 bacteria (Promega) were incubated with 100 ng of the Puc 8 DNA according to the instructions provided with the transformation competent cells. Briefly, this involved a heat shock of the admixture for one minute at 42° C., incubation of the bacteria in LB broth at 37° C. for one hour, and overnight growth of the bacteria on LB agar Petri plates containing 50 μg/mL ampicillin. Colonies from each transformation were then counted.

    ______________________________________                                         ,                                 Trans                                        dNTP in TdTase                                                                              Vector               formed                                       Reaction     Substrate   Religation                                                                              Colonies                                     ______________________________________                                         none         Pst 1 - Puc 8                                                                              yes      1,381                                        (positive control)                                                             none         "           no       325                                          (background)                                                                   dideoxy-ATP  "           yes      342                                          dATP 3'-0 toluate                                                                           "           yes      316                                          dATP 3'-0 ether                                                                             "           yes      330                                          dATP 3'-phosphate                                                                           "           yes      340                                          dATP-3'OH    "           yes      636                                          ______________________________________                                    

The results demonstrate that the protected nucleotides are utilized by TdTase for the creation of phosphodiester bonds. The covalent attachment of the protected nucleotide to the vector DNA blocks the vector from religation. In the case of the unprotected nucleotide (dATP 3'-OH) the enzyme may be predominantly adding homopolymer tails to a population of vector molecules leaving some vectors unmodified.

The efficiency of the TdTase catalyzed transfer, as measured by the numbers of colonies in excess of the background value, were comparable when comparing the protected mononucleotide with dideoxynucleotide. The absence of transformed colonies above the background value compared to a control which produced greater than 1000 colonies, indicates a TdTase catalyzed transfer of protected mononucleotide to ≧99.9% of the initiating substrate 3' hydroxyls.

Example 4 Inhibition of phosphodiester bond formation by protected nucleotides.

Attachment of a protected mononucleotide to vector DNA will prevent the subsequent attachment of a biotin labelled nucleotide so long as the protecting group is affixed to the 3'-hydroxyl. This inhibition of vector biotinylation can be readily quantified by blotting assays after agarose gel electrophoresis.

Vector DNA (either Puc 8 or pBluescript) digested with the appropriate restriction enzyme, was reacted with approximately 100 μM protected nucleotide for varying times in the presence of 25 units TdTase (Promega or BRL) in appropriate buffers in a final volume of 25 μL. To the reaction mix was then added 1 μL of 300 μM biotinylated dUTP (Sigma or Boehringer) and the reaction was allowed to proceed for 1-3 minutes at which point the reaction was stopped by the addition of 1 μL of 1% sodium dodecylsulfate, 50 mM Na₂ EDTA. The mixture was heated to 75° C. for 1 minute then electrophoresed in an agarose gel to visualize the DNA. In related assays, the vector DNA was purified from the other components of the reaction prior to the addition of biotinylated nucleotide. Purification was by centrifugal chromatography on Sepharose CL-6B. This step was included to avoid the possibility that low molecular weight inhibitors were slowing the activity of the TdTase.

The incorporation of biotin into the DNA was measured using a standard dye reaction procedure. The DNA was first blotted onto a piece of nitrocellulose paper. The nitrocellulose paper with the DNA adhering to it was then heated to 80° C. in a drying oven for 30 minutes and re-wetted in 25 mL of 50 mM Tris-Cl pH 8.1, 150 mM NaCl, 0.1% Triton-X-100 (TBST) and 10% (w/v) Carnation non-fat dry milk, a solution which is intended to enhance the contrast of the final dye reaction. After 1 hour of incubation in the milk solution, a fresh solution of TBST containing approximately 1 μg/mL of streptavidin alkaline phosphatase (Fisher Scientific #OB5000-ALPH) was added to the paper. Binding of streptavidin to biotin proceeded for 1 hour at room temperature. The paper was then transferred to 25 mL of fresh TBST for 10 minutes to wash off excess streptavidin-alkalin phosphatase. This washing step was repeated four times. The paper was then transferred to 10 mL of 100 mM Tris-Cl, pH 9.5, 150 mM NaCl, 5 mM MgCl2, 300 μg/mL nitrotetrazolium blue and 150 μg/mL bromochloroindolyl phosphate to visualize the quantity of bound streptavidin phosphatase by enzymatic release of the chromophoric bromochloro indole.

The results of the inhibition assays using a variety of blocking groups is summarized below.

    ______________________________________                                         3' Protecting                                                                  group       Biotinylation                                                                               Reaction                                              (%)         time (min)   time (min)                                                                              Inhibition                                   ______________________________________                                         para-toluoyl                                                                               0.5-5        0.5-5    >50%                                         benzoyl     "            "        "                                            isovaleroyl "            "        "                                            dimethylbenzoyl                                                                            "            "        "                                            ethoxyethyl "            "        "                                            phosphate   "            "        "                                            ______________________________________                                    

Example 5 DNA synthesis using protected dNTPs: synthesis of a new restriction site in the Puc 8 vector.

To demonstrate the synthesis of a desired DNA sequence directly onto a vector DNA by the TdTase catalyzed addition of the protected dNTPs, we performed sequential reactions on Pst 1-digested Puc 8 DNA in order to introduce a new restriction site into the vector. The sequence at the termini of the Pst1 Puc 8 DNA is:

                    5'G-----------------------CTGCA3' (SEQ ID NO.6)                                        Puc8                                                   (SEQ ID NO. 6) 3'ACGTC-----------------------G5'                          

where the dotted lines indicate the annealed complementary strands of the vector. Sequential coupling and cleavage reaction were performed using the toluoyl esters of dNTPs as follows:

First coupling reaction--100 mM potassium cacodylate, pH 6.8, 1 mM CoCl₂, 0.1 mM DTT, 0.1 mg/mL BSA, 100 μM dTTP-3'O-toluate, 40 units TdTase (Promega), 37° C., 2 minutes.

Stop reaction--1 μL 100 mM Na₂ EDTA, 1 μL 10% sodium dodecyl sulfate, 65° C., 2 minutes.

DNA recovery--Centrifugation through 0.5 mL packed Sepharose™ CL-6B in water.

Ester cleavage reaction--100 mM potassium cacodylate, pH 6.8, 1 mM CoCl₂, 0.1 mM DTT, 0.1 mg/mL BSA, 37° C., 30 minutes.

Second coupling reaction--100 μM dGTP-3'O-toluate, 40 units TdTase (Promega), 37° C., 2 minutes.

Repeat stop, recovery and cleavage.

Third coupling reaction--100 μM dCTP-3'O-toluate, 40 units TdTase (Promega), 37° C., 2 minutes.

Repeat stop, recovery and cleavage.

Fourth coupling reaction--100 μM dATP-3'O-toluate, 40 units TdTase (Promega), 37° C., 2 minutes.

Repeat stop, recovery and cleavage.

Final recovery of DNA--Centrifugation through 0.5 mL packed Sepharose™ CL-6B in water.

A similar series of reaction were performed using the 3'-phosphates of the dNTPs with some modifications.

First coupling reaction--100 mM potassium cacodylate, pH 6.8, 1 mM CoCl₂, 0.1 mM DTT, 0.1 mg/mL BSA, 100 μM dTTP-3'-phosphate, 40 units TdTase (Promega), 37° C., 2 minutes.

Stop reaction--1 μL 100 mM Na₂ EDTA, 1 μL 10% sodium dodecyl sulfate, 65° C., 2 minutes.

DNA recovery--Centrifugation through 0.5 mL packed Sepharose™ CL-6B in water.

Phosphate cleavage reaction--0.1 m Tris.Hcl, pH 9.0, 0.1 m Nacl, 10 mM MgCl₂, and 20 units of alkaline phosphatase, 37° C., 30 minutes.

Second coupling reaction--100 μM dGTP-3'-phosphate, 40 units TdTase (Promega), 37° C., 2 minutes.

Repeat stop, recovery and cleavage.

Third coupling reaction--100 μM dCTP-3'-phosphate, 40 units TdTase (Promega), 37° C., 2 minutes.

Repeat stop, recovery and cleavage.

Fourth coupling reaction--100 μM dATP-3'phosphate, 40 units TdTase (Promega), 37° C., 2 minutes.

Repeat stop, recovery and cleavage.

Final recovery of DNA--Centrifugation through 0.5 mL packed Sepharose™ CL-6B in water.

The modified vector DNA was intended to have the following new DNA sequence:

                             5'G----------------------CTGCATGCA3' (SEQ ID          NO.7)                                                                                                                Puc8                                     (SEQ ID NO.7) 3'ACGTACGTC-----------------------G5'                       

To demonstrate the presence of this new sequence in the vector, the modified Pst 1-Puc 8 was religated as previously described for 18 hours at 16° C. The resulting recircularized or concatemerized plasmid would have the following new structure in the Puc 8 polylinker:

    5'--CTGCATGCAG--3'(SEQ ID NO.8)

    3'--GACGTACGTC--5'(SEQ ID NO.8)

where the underlined portion is the recognition sequence for the Sph 1 restriction enzyme, which did not previously exist in the vector.

The relegated vector was passed through a CL-6B spun column and incubated in the Sph 1 restriction enzyme buffer and 10 units of Sph 1 (New England Biolabs). Agarose gel electrophoresis revealed that the original Puc 8 DNA contained no Sph 1 recognition sequences and that the recovered DNA after the TdTase reactions contained the desired sequence.

To demonstrate the significance of the blocking groups, an identical protocol was followed using unblocked nucleotide triphosphates in the synthesis reactions. The final religation product contained no detectible Sph 1 sequences.

The foregoing examples and description of the preferred embodiment should be taken as illustrating, rather than as limiting, the present invention as defined by the claims. As will be readily appreciated, numerous variations and combinations of the features set forth above can be utilized without departing from the present invention as set forth in the claims. All such modifications are intended to be included within the scope of the following claims.

    __________________________________________________________________________     #             SEQUENCE LISTING                                                 - (1) GENERAL INFORMATION:                                                     -    (iii) NUMBER OF SEQUENCES:  8                                             - (2) INFORMATION FOR SEQ ID NO: 1:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #    12   (A) LENGTH:                                                          #      nucleic acid                                                            #single   (C) STRANDEDNESS:                                                    #   linear(D) TOPOLOGY:                                                        -     (ix) FEATURE:                                                            #base number 12 is m7gFORMATION:                                               #1:   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #       12                                                                     - (2) INFORMATION FOR SEQ ID NO: 2:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #    19   (A) LENGTH:                                                          #      nucleic acid                                                            #single   (C) STRANDEDNESS:                                                    #   linear(D) TOPOLOGY:                                                        #2:   (ii) SEQUENCE DESCRIPTION: SEQ ID NO:                                    # 19               GCA                                                         - (2) INFORMATION FOR SEQ ID NO:    3:                                         -      (i) SEQUENCE CHARACTERISTICS:                                           #    20   (A) LENGTH:                                                          #      nucleic acid                                                            #single   (C) STRANDEDNESS:                                                    #   linear(D) TOPOLOGY:                                                        #3:   (ii) SEQUENCE DESCRIPTION: SEQ ID NO:                                    # 20               GGGG                                                        - (2) INFORMATION FOR SEQ ID NO:    4:                                         -      (i) SEQUENCE CHARACTERISTICS:                                           #    9    (A) LENGTH:                                                          #      nucleic acid                                                            #single   (C) STRANDEDNESS:                                                    #   linear(D) TOPOLOGY:                                                        #4:   (ii) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #          9                                                                   - (2) INFORMATION FOR SEQ ID NO:    5:                                         -      (i) SEQUENCE CHARACTERISTICS:                                           #    9    (A) LENGTH:                                                          #      nucleic acid                                                            #single   (C) STRANDEDNESS:                                                    #   linear(D) TOPOLOGY:                                                        #5:   (ii) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #          9                                                                   - (2) INFORMATION FOR SEQ ID NO:    6:                                         -      (i) SEQUENCE CHARACTERISTICS:                                           #    5    (A) LENGTH:                                                          #      nucleic acid                                                            #single   (C) STRANDEDNESS:                                                    #   linear(D) TOPOLOGY:                                                        #6:   (ii) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #             5                                                                - (2) INFORMATION FOR SEQ ID NO:    7:                                         -      (i) SEQUENCE CHARACTERISTICS:                                           #    9    (A) LENGTH:                                                          #      nucleic acid                                                            #single   (C) STRANDEDNESS:                                                    #   linear(D) TOPOLOGY:                                                        #7:   (ii) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #          9                                                                   - (2) INFORMATION FOR SEQ ID NO:    8:                                         -      (i) SEQUENCE CHARACTERISTICS:                                           #    10   (A) LENGTH:                                                          #      nucleic acid                                                            #single   (C) STRANDEDNESS:                                                    #   linear(D) TOPOLOGY:                                                        #8:   (ii) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #        10                                                                    __________________________________________________________________________ 

We claim:
 1. A method for synthesizing a polynucleotide of a predetermined sequence, comprising:(a) providing an initiating substrate comprising a nucleoside having an unprotected 3'-hydroxyl group; and (b) reacting said initiating substrate with a nucleoside 5'-triphosphate having its 3' position protected with a removable blocking moiety, wherein said nucleoside 5'-triphosphate is selected according to the order of said predetermined sequence, in the presence of an enzyme that catalyzes the formation of a 5' to 3' phosphodiester linkage between said unprotected 3'-hydroxyl group of said initiating substrate and a 5'-phosphate of said nucleoside 5'-triphosphate, so as to add said nucleoside to said initiating substrate.
 2. The method of claim 1, wherein said enzyme is a template-independent polynucleotide polymerase.
 3. A method as in claim 1 or 2 further comprising:c) removing the blocking moiety protecting the 3' position of said nucleoside 5'-triphosphate to produce an initiating substrate having an unprotected 3'-hydroxyl group.
 4. The method of claim 3 further comprising repeating steps (b) and (c) at least once.
 5. The method of claim 3 further comprising repeating the steps (b) and (c) until the polynucleotide having the predetermined sequence is obtained.
 6. A method as in claim 1 or 2, wherein said initiating substrate is selected from the group consisting of ribonucleosides, deoxynucleosides, nucleotides, and single and double stranded oligonucleotides and polynucleotides.
 7. A method as in claim 1 or 2, wherein said initiating substrate further comprises oligonucleotide sequences.
 8. The method of claim 7, wherein said oligonucleotide sequences are attached to non-nucleoside molecules.
 9. A method as in claim 1 or 2, wherein said initiating substrate is immobilized on a solid support.
 10. The method of claim 9, wherein said solid support is selected from the group consisting of cellulose, controlled-pore glass, silica, polystyrene, styrene divinyl benzene, agarose and crosslinked agarose.
 11. The method of claim 2, wherein said template-independent polynucleotide polymerase is terminal deoxynucleotidyl transferase.
 12. A method as in claim 1 or 2, wherein said removable blocking moiety is removed in under 10 minutes to produce a hydroxyl group at the 3' position of the 3'-terminal nucleoside.
 13. The method of claim 12, wherein said removable blocking moiety is removed in under 2 minutes to produce a hydroxyl group at the 3' position of the 3'-terminal nucleoside.
 14. A method as in claim 1 or 2, wherein said removable blocking moiety is selected from the group consisting of esters, ethers, carbonitriles, phosphates, phosphoramide, carbonates, carbamates, borates, nitrates, sugars, phosphoramidates, phenylsulfenates, sulfates, and sulfones, wherein said removable blocking moiety is linked to the 3' carbon of said nucleoside 5'-triphosphate.
 15. A method as in claim 1 or 2, wherein said removable blocking moiety is selected from the group consisting of an ester, phosphorous containing moiety and an ether.
 16. The method of claim 15, wherein said ester is selected from the group consisting of toluoyl ester, isovaleroyl ester, benzoyl ester, 4-nitrobenzoyl esters 2,6 dimethylbenzoyl ester, 3,5 dimethylbenzoyl ester and dimethylbenzoyl ester.
 17. The method of claim 15, wherein said ether is selected from the group consisting of bis(2-chloroethoxy)methyl ether, 4-methoxytetrahydropyranyl ether, tetrahydrafuranyl ether, 1-ethoxyethyl ether, tri(p-methoxyphenyl)methyl ether, di(p-methoxy)phenylmethyl ether, t-butyldimethylsilyl ether.
 18. The method of claim 15, wherein said phosphorous containing moiety is selected from the group consisting of phosphate, phosphoramidate and phosphoramide.
 19. A method as in claim 1 or 2, further comprising treating said nucleoside 5'-triphosphate having said removable blocking moiety with a deblocking solution whereby said removable blocking moiety is removed.
 20. The method of claim 19, wherein said deblocking solution comprises a divalent cation.
 21. The method of claim 20, wherein said divalent cation is Co⁺⁺.
 22. The method of claim 19, wherein said deblocking solution comprises a buffer selected from the group consisting of dimethylarsinic acid, tris[hydroxymethyl] amino methane and 3-[m-morpholine] propianosulphonic acid.
 23. The method of claim 19, wherein said deblocking solution comprises an enzyme that catalyzes the removal of said removable blocking moiety.
 24. The method of claim 19, wherein said treating occurs in under 10 minutes.
 25. The method of claim 24, wherein said treating occurs in under 2 minutes.
 26. A method as in claim 1 or 2, wherein said removable blocking moiety is linked to a solid support.
 27. The method of claim 26, further comprising cleaving said polynucleotide from said solid support.
 28. The method of claim 27, wherein said cleaving produces a polynucleotide having a 3'-hydroxyl group at its 3'terminus.
 29. The method of claim 26, wherein said removable blocking moiety linked to said solid support is selected from the group consisting of esters, ethers, carbonitriles, phosphates, carbonates, carbamates, borates, nitrates, sugars, phosphoramide, phosphoramidates, phenylsulfenates, sulfates, sulfones and amino acids, wherein said removable blocking moiety is linked to the 3'position of said nucleoside 5'-triphosphate and is also linked to said solid support. 